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ABSTRACT 

Recently there appears to have been an increase in 
interest in the relative power of general ability and narrower 
cognitive aptitudes to predict real world performance in training 
programs and on the job. This area has important practical 
implications for personnel selection and classification, particularly 
for large organizations such as the United States military which 
assigns people differentially to jobs based on patterns of measured 
abilities. This study ^ xamined the ability of Armed Services 
Vocational Aptitude Bat (ASVAB) , forms 8, 9, and 10 to predict 
training performance, us ^ technical jobs in the U.S. Navy with 
at least 900 people in each 30b. Performance was the measured 
criterion. For one job. Mess Management Specialist, performance was 
measured as final school grade. The other nine technical jobs were 
self-paced; the criterion was the number of hours required to 
complete training. General mental ability alone did about as well as 
differential weighting at the level of the three general aptitudes 
(quantitative, verbal, technical). Using all 10 specific aptitude 
tests as separate predictors increased validity by about eight 
percent. Future analysis will examine the specific values of beta 
weights and will be testing Hunter's (1983) path models for fit to 
the data from ind:vidual jobs. (ABL) 
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„UU,e p„»e. sene..l ,e«al aMU.y an. na„o.« co^niUve ap.i.uaes 
„ .eal »orla pe...r.a„ce In t.ainin, p»sr-s ana on *e Jo. 

, Tho.nai.., »86., Hunte. »86, . thU ,uasUon has l.por.ane 

.^ulaUons fo. *eo.Us o. .^an co^iclve a.UUies. I£ nar.owe. 

1 oKnirv then the status 
abllitia. ada nothing to prediction over general ability, 

„f narrower abilities within theories of ability will have to be 
.econ^lderea. Xn addition, It ha. Important praotlcal LpUoatlon. for 
personnel selection and classification, partlcnlarl, for Urge 
..ganuatlons ll.e the ..S. military which assign people differentially to 

lobs based on patterns of measured abilities. 

„f rh. relative predictive power of 
Data relevant to the question of the relatl 

.hinties comes from the civilian, as 
general ,.ental ability and narrower abilities com 

paper, however. Is concerned with research on military personnel. Recent 

.ppears to Indicate that general cognitive ability Is as good or better a 

mnirarv iob families as 
„ <n rralnine in most military j"" 
predictor of performance in training 

4f^rallv to predict success in particular 
ability composites derived specifically to pr 

famines. These findings are contrary to the current theory that Is 
.he fo^datlon of differential assignment of personnel to jo.s In the 

differential aptitude theory (or specific aptitude 
military. That theory, differential p 

.„rlruae factors assessea by particular 
theory), postulates that specific aptltuae 
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cognitive ability. 

Cas. m causal .l«e.e„cial apticuae .heory postulates t.at 

.,ee«U aptuu^s .a. a causa. ..act on pet.ot„a.cc ovet ana a.o. an, 

vauauus actoss ,o.s .ot ,ae. test ate usea, t.e causa. 
,,Uea a.«etentU. aptUuae t.eo. aoes not nt Ut.e sa.pU «U ta. 

en., aM..t, caus.n. pet.ot^ance .s .eneta. co^.t.ve aM..t, ana .n c. 
.pt.tuaes ate t.eese.ves causea .eneta. co.n.t.ve aM..t,. »o causa. 
patHs to pet.ot„ance .to„ e.t.et .eneta. apt.tuaes (Vet.a., Quantitative, 

. . 1 „, soecltlc apt.tuaes (aet.nea 
ana Technic..; ae£lnea by c.usters of tests) or specltl 

single tests, wete necessary to the aata. This theory o. the 
unaerlyln, processes caus.n. perlor«ance prea.cts that for ...tary .oh 

1 ..nitive ability would predict performance at least as 
families, general cognitive aDiiicy r 

. of sDecific aptitude derived to predict 
well as regression-based composites of specific p 

. • K f^mllv In his analyses, this prediction 
performance in the particular job family. 

was generally borne out. 

nrllltv of Classification 
'»^""''°"Vs-- in'the 'Unitary 

These imam^s have profouna l«p..cat.ons lor the utility ol the 
co^le. systees usea .n the military Cor the aUlerent.a. ass..n.ent ol 
personne.to,ohs(personne.c.ass.flcatlonsyste.s,. In oraer lor a 
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aspect of the system (i.e.. beyond the utility resulting fro. rejection of 
generally unpromising applicants), the regression equations for predicting 
performance must be different for different jobs or job families. In 
traditional military classification systems, both test scores and job 
performance have been standardized, and mean levels of job performance have 
been implicitly assumed to be equal across jobs (Brogden. 1959). Given 
these assumptions and Hunter's findings, if the validity of general 
cognitive ability is equal for all jobs or job families, then all (standard 
score) regression equations are identical, to wit: 



z - rz 



where: z - predicted performance in standard score form for applicant 1. 
^1 

r - the validity of general cognitive ability; and 

z - the standard score on general cognitive ability for 
'^i applicant i. 

Under these circumstances, personnel classification has no value over and 
above that of selection. The actual findings were not quite this drastic, 
because the validity of general mental ability was found to vary somewhat 
across job families. For example, the validity was found to somewhat lower 
for the Combat job family and for the Equipment Operator job family 
(Hunter. 1983). This variation in validity means that the standard score 
regression equations are not identical and that there is some small room 
left for differential assignment to have utility. (Differential assignment 
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based on a single predictor is called placement; Cronbach and Gleser, 
1965.) 

Nevertheless, Hunter's findings imply that the utility of military 
classification systems is at best only a small fraction of what it has been 
assumed to be. Because this implication is so profound, it is advisable to 
conduct further tests of his findi gs. 

Hunter's path analytic studies were conducted using average validities 
across all jobs for wliich validity data were available; these studies led 
to the prediction that general cognitive ability should have higher 
validity than regression-based composites of specific aptitudes for every 
job. However, individual jobs with sample sizes large enough to test thii 
prediction were not available. Because of this, Hunter (1983) used 
existing military job families as his unit of analysis, with each job 
family being represented by the average validities of the composites (and 
of general cognitive ability) across the jobs making up the job family. If 
the job families had been developed by a means that ensured that the jobs 
in each family would be homogeneous for aptitude requirements (if there 
were indeed such differential requirements) this approach would have had no 
problems. However the job families were originally created by a method 
that capitalizes heavily on chance configurations in the beta weights, 
i.e., that capitalizes on sampling error in the beta weights (Hunter 1983). 
Using samples that were often small in relation to the number of tests, 
jobs that had similar patterns of beta weights were grouped together into 
job families. To the extent that job families were based on random 
sampling error in the beta weights, the jobs in any given job family will 
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be a random sample of all jobs with respect to aptitude requirements (if 
there are differential aptitude requirements). Although complete 
randomness is almost certainly not the case, there is undoubtedly a 
substantial random component (as noted by Hunter, 1983), and this would 
serve to blur any underlying differential aptitude requirements. 

This problem can be solved by testing the differential aptitude theory 
using individual jobs that have very large samples. If Hunter's findings 
are verified using such a data base, his conclusions would appear to be 
unshakable and should then be incorporated into military human resources 
procedures and policies. This would probably entail: (1) seeking a new 
basis for classification utility in differences among jobs in average value 
to the military and in within-job variability of performance value to the 
military; (2) seeking new predictors that might create differential 
validity (e.g., psychomotor tests); and (3) a focus on increasing and 
measuring the utility of the selection component alone (which is 
substantial). On the other hand, if differential aptitude theory is 
supported, an obvious need will be the development of better methods of 
creating job families in order to take advantage of classification utility 
resulting from differential prediction. This follows from two facts which 
prevent the use of differential prediction at the level of individual jobs: 
(1) for many jobs, sample sizes will probably never be large enough for 
reliable determination of differential weights; and (2) even if the 
n^jcessary sample sizes were available, for practical reasons operationa?. 
classification systems must be based on a smaller number of job groupings 
(i.e. , job families) . 
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There are three levels of ability generality - specific aptitudes 
(assessed by individual tests), general aptitudes (assessed by clusters of 
tests identified by confirmatory factor analyses; e.g., verbal ability), 
and general cognitive ability (assessed by a cluster of general aptitudes 
identified by confirmatory factor analysis). Traditional 
operationalization of differential aptitude theory in the military has 
focused on specific aptitudes. Regression based composites of scores on 
individual tests have been derived to predict performance in individual 
jobs and job families. Assuming the validity of differential aptitude 
theory, this approacn would be satisfactory only if every test measured a 
different aptitude. If this were not the case, and if, for example, the 
Work Knowledge Test and the Paragraph Comprehension Test of the ASVAB both 
measured the same aptitude (verbal ability), then the beta weight for 
verbal aptitude would be split between the two tests in the regression 
analysis, with the specific split depending on sampling error in the 
validities and test i^itercorrelations (assuming equal test reliabilities). 
Since intercorrelations among tests measuring the same aptitude would be 
high, computed beta weights would have large standard errors, and would be 
unstable from sample to sample. Stability aside, they would also be 
difficult to interpret correctly (e.g., in our example, in terms of the 
importance of verbal aptitude). That is, if differential aptitude theory 
were valid at the level of general aptitudes but not valid at the level cf 
specific aptitudes, then beta weights for individual tests would be both 
unstable and difficult to interpret. 
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Under these circumstances, factors specific to general aptitudes, but 
not those specific to individual tests » would contribute to validity over 
and above the contribution of general cognitive ability. In such a case, 
the problems described above could be avoided by conducting the analyses at 
the level of general aptitudes, rather than at the level of individual 
tests. The expected finding would then be that regression-weighted 
composites of general aptitudes would predict performance better than 
general cognitive ability alone. Because of these considerations, 
differential ptitude theory should be tested at the level of general 
aptitudes as well as at the level of individurl tests. 

Method 

The tests used were the 10 subtests of The Armed Services Vocational 
Aptitude Battery (ASVAB) , forms 8, 9 and 10. The ASVAB was administered 
prior to entry into the Navy; all validities are predictive. These tests 
are described in Table 1; reliabilities are also given. 

The preliminary study reported here is based on 10 technical jobs in 
the U.S. Navy. The goal was to have data for 1,000 or more people for each 
job; however, two jobs were included with Ns of less than 1,000 (Ns of 958 
and 928). The jobs and their sample sizes are shown in Table 3. 

The criterion was performance ir Navy technical schools. For one job 
(Mess Management Specialist), performance was measured in the traditional 
manner: as final school grade (FSG). FSG is based on the average 
achievement tests administered during training. The other nine technical 
schools were self -paced; the criterion was the total number of hours 
required to complete training. 
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Because interest is in the predictive power of the tests in the 
applicant group, all validities were corrected for multivariate range 
restriction by the Navy. Intercorrelations among the tests were estimated 
on a sample of over 140,000 Navy applicants. 

The predictive power of specific aptitudes was estimated by multiple 
regression using all 10 tests. 

Hunter (1983; see also Kass et al., 1982) fjund that there are three 
general cognitive aptitudes underlying The ASVA3: Quantitative., Verbal and 
Technical. The tests that best measure these general aptitudes are: 
Q - AR + MK 
V - WK + GS 
T - MC + EI 

In this study, Q, V and T were measured by these standard score (equally 
weighted) sums. The predictive power of general aptitude theory was 
assessed by regressing criterion scores on Q, V and T composites. 
General mental ability is measured by the sum of the three standardized 
cognitive aptitude composites: 
G - Q + V + T, or 

G - (AR + MK) + (WK + GS) + (MC + EI) 
These relations are summarized in Table 2. 

The multiple Rs for the 10 tests and for the three general aptitude 
composites were shrunken using the Wherry formula (Catlin, 1980) to adjust 
for capitalization on chance. However, the sample sizes used in the 
shrinkage formula had to be adjusted to take into account the fact that the 
multivariate range restriction correction increases the standard error of 
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/ J « ^>,P effective sample size. For 
PfiHpnts which reduces the errecti. 
the validity coefficients, n „cr4mated 

«f thB average validity was estimatea 
each job. the sampling error variance of the g 

as Cr,/'^)'sl - Sl^. where: 

^ - the mean range corrected validity 
c 

^ - the mean uncorrected validity 

tn the range correction. 
S2 - the mean sampling error variance p'ior to 

2' V .amnling error variance of the range corrected 

S - the mean sampling cliwi. 
e 

c 

validities. 

The effective sample size 

for each iob as follows: 
(N ) was then computed for eacn j 



-2^2 ^ c2 
(1 - r ) + 

e 

c 



formuU is obtained by solving the standard sampling error variant 

for r for . T^e adjusted . were considerably smaUer than the 

. in Table 3 Sampling error variance is increased 
nominal Ns . as shown in Table , N is 

»^M-ons This adjustment to N is 

,„X, an app.»x...aa«., .uc no .o.e exacc «e..n<. appea. .n 

^^^^^^^^^ 

It should be noted that the wnerry 

^ «^ >,v usine the population weights 
correlation that would be produced by using 

. the exact values of those weights and not estimates of 

themselves, i.e.. the exact vaj 

^he exact population weights (which are 
them In real testing programs, .he exact pp 

. ^.^.ates a-e available, and since estimate 
optimal) are never known. Only estimates 
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depart from the true values, they produce a population correlation that is 
smaller. Tlius our results fot t*:e 10 tests and QVT are up^)er bound values 
for validities; in practice, smaller values would be expected. This is not 
true for G, because the correlation for G is a zero order correlation. 

Results and Discussio n 

The results are best summed up by the means at the bottom of Table 3. 
On the average, differentially weighting the V, Q and T composites using 
population regression weights is estimated to increase validity only by .01 
or 2%. This tiny increase may not even be replicable. Thus, on average, 
general mental ability alone does about as well as differential weighting 
at the level of the three general aptitudes. This finding might be 
surprising to those who would expsct that prediction could be enhanced for 
these technical jobs by assigning larger weights to Technical Ability (and 
perhaps also to Quantitative Ability). 

Using all 10 tests as separate predictors, as suggested by specific 
aptitude theory, increases validity by an average of .04 or 8%. While this 
increment may not seem Impressive, it is four times larger than the average 
QVT increment and, if real, would be of practical value. However, it is by 
no means certain at this point that the increment is real. TV is study is 
only preliminary, and we will be looking at more (and better) data in the 
future. Data to be analyzed in the future is based on traditional training 
school grades rather *"han tJme taken in training. Previous research 
indicates relations between test scores and traditional grades are linear 
or close to linear. This may not be true for the time taken in training 
measure » and so far it has not been possible to test for linearity in those 
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data. If there are violations of linearity, these, in combination with 
multivariate range restriction corrections, could distort findings. So 
more information is required before concluding that specific aptitude 
theory is suppor :ed over general aptitude theory and general ability 
theory . 

The above results are for observed scores. The more theoretically 
minded among you may wonder how the findings might look at the true '•core 
level. That is, if we were hypothetically able to measure each of the 10 
specific aptitudes with perfect reliability, and measure the Q, V, T and G 
factor.^ perfectly reliably, how would the results look? This aiialysis was 
conducted, correcting for unreliability in each of the 10 tests using the 
figures in Table 1. Confirmatory factov analyses were used to estimate the 
intercorrelation of factors and true score validity for Q. V, T and G. As 
expected, all correlations increased. However, the relative performance of 
specific aptitude theory, general aptitude theory, and general ab^''ity 
theory was very much the same as shown in Table 3 for observed scores. 

In future analysis, we will also be examining the specific values of 
beta weights and will be testing Hunter's (1983) path models for fit to the 
data from individual jobs. Analysis will be run for both observed and true 
scores . 

Because the data requirements for strong tests ol these three theoriej 
are very stringent, and because there are some complex and tricky 
statistical and measurement problems involved in these tests, we do not yet 
have final answe's. But we feel that progress has been madt% and will 
continue to be made. 
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T^le 1 

Predictor Variables Fcon ASVAB Forms 8, 9, and 10 



a b 
Test Abbreviation Reliability Description 



General Science GS .86 A 25- item test of knmledlgf. of the 

physical (13 itens) and biological 
(12 items) sciences- -11 ndiutes. 



Arlthnetlc Reasonii^ AR .91 

Iford Krmledjge IK .92 

Paragraph Conoprdiension PC .81 

Nunerlcal Operations NO .78 

Godlr^ Spee.-^ CD .85 

ALEto and Shop AS .87 
Information 

Mathematics Knowledge MC .87 

Hadianlcal (^cnprdiBnsion .85 

Electronics Information EI .82 



A 30- item test of ability to solve 
arlthnetlc wDrd problems- -3^' 
minutes. 

A 35- item test of knowledlge of vDcabu- 
lary, usli^ wrds mhedcted in sentences 
(11 items) and synonyms (24 items) 
11 minjtps. 

A 15-item test of readii^ caiprdiension 
--13 miiutes. 

A 50- item speeded test of ability to 
add, sd>tract, nuLtiply, and divide 
one- and tuo-dlgit Tuibers--3 miiiites. 

An 84- item speeded test of ability to 
recognize nunbers associated with words 
from a table--7 minutes. 

A 25- item test of knowledge of auto- 
mobiles, shop practices, and use of 
tools- -11 minutes. 

A 25- item test of knowledge of pl^sbca, 
gecroetry, fractions, decimals, and 
exponents- -24 minutes. 

A 25- item test of knowledge of mechani- 
cal and physical principles- -19 mimtes. 

A 20- item test of knowledge of elec- 
tronics, radio, and electrical princi- 
ples and information- -9 minutes. 



^'Reported a^ Navy Standard Scores havli)g a mean of about 50 and a standard deviation of 
10 for an isirestrlcted recruit population. 

-20 xellabilitles for pouer tests; parallel form reliabilities for No and CS; 
franKass et al. (1982). 
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Table 2 

Definitions of General Aptitudes and 
General Mental Ability 

Q _ AR + MK 

Vrr^r-I (r.^nreotual): 

V - WK + GS 
T - MC + EI 

M -n^^l AhUit^^: 

G - Q + V + T 
G + (AR + MK) + (WK + GS) + (MC + EQ) 



Note : 



See Table 1 for test names. 
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Table 3 

E«.timated Validities Using 10 Tests, 3 Aptitudes 
and General Mental Ability Alone 



T?b (ffl^ Titles) 

1. Mess Mgmt. Spec. (MS) 

2. Aviation Elec. Tech. (AT)^ 

3. Aviation Elec. Tech. (nT)^ 

4. Electronics Tech. (ET)^ 

5. Electronics Tech. (ET)^ 

6. Boiler Tech. (BT) 

7. Engineman (EN) 

8. Machinists Mate (MM) 

9. Aviation Elec. Mate (AE) 
10. Electricians Mate (EM) 



Sample 
Nominal 
1581 


Size 


b 


Adj.^ 
909 


10 Tests 
.59 


QVT 
.58 


G 

.56 


1489 


338 


.46 


.42 


.43 


2245 


679 


.78 


.75 


.75 


958 


112 


.70 


.66 


.64 


928 


/% "f 
207 


.61 


.61 


.59 


2085 




.48 


.44 


.44 


1258 


583 


.44 


.39 


.38 


2598 


1181 


.37 


.36 


.35 


1606 


651 


.54 


.53 


.53 


1109 


158 


.54 


.53 


.53 




Means 


.56 


.53 


.52 


Increase 


over G 


8% 


2% 





»A^<.,.r-.d M the efterri N allowing tor the Increase In the standard error of the 
C'lldHtes rS^ltlng tr .Ve correLlons for .ultlvarlate range restriction. 



''Multiple Rs for the IC 

*^'**Independent samples r 
unknowr, . 



nid QVT are shrunken using the Wherry formula, 
-amc job; the explanation for the disparities is 
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